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The first step when forming the polynomial hierarchies of languages is to consider languages of the 
form KaL where K and L are over a finite alphabet A and from a given variety 'f of languages, a G A 
being a letter. All such KaUs generate the variety of languages BPoli ('^). 

We estimate the numerical parameters of the language KaL in terms of their values for K and 
L. These parameters include the state complexity of the minimal complete DFA and the size of the 
syntactic monoids. We also estimate the cardinality of the image of A* in the Schiitzenberger product 
of the syntactic monoids of K and L. In these three cases we obtain the optimal bounds. 

Finally, we also consider estimates for the cardinalities of free monoids in the variety of monoids 
corresponding to BPoli(l^) in terms of sizes of the free monoids in the variety of monoids corre- 
sponding to y. 

1 Introduction 

The polynomial operator assigns to each variety of languages Y the class of all Boolean combinations 
of the languages of the form 

LQa\L\a2 ■ -.aeLe , (*) 

where A is a finite alphabet, ai,...,ai G A, Lq, . . . ,Li G y{A) (i.e. they are over A). Such operators 
on classes of languages lead to several concatenation hierarchies. Well known cases are the Straubing- 
Therien and the group hierarchies. Concatenation hierarchies has been intensively studied by many 
authors - see Section 8 of the Pin's Chapter [4 |. In the restricted case we fix a natural number k and we 
allow only £ <km {*) - see IH and papers quoted there. The resulting variety of languages is denoted 
by BPol|t('^). Using the Eilenberg correspondence, BPolj- operates also on pseudovarieties of monoids. 
We consider in this paper only the case k = I. 

State complexity problems are a fundamental part of automata theory. Recent papers of a survey 
nature with numerous references are HI by Brzozowski and HI by Yu. First we estimate the state 
complexity of DFA automata for the language KaL in terms of the state complexities of K and L. This is 
the content of Section 2. 

Secondly, for languages K and L, we also estimate the cardinality of the image of A* under the natural 
homomorphism Ha into the Schiitzenberger product of the syntactic monoids M and of the languages 
K and L. This monoid /Xa(A*) recognizes the language KaL, too. The syntactic monoid of KaL is a 
homomorphic image of the monoid /x„(A*). The third question concerns its cardinality. 

In all three problems we get estimates which can be reached by concrete examples (for the first one 
in Section 2 and for the two remaining ones in Section 3). In general: the size of the Schiitzenberger 
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product equals at least to the size of the monoid /^^(A*) which is at least the size of the syntactic monoid 
of KaL. In Section 3 we further consider natural examples showing that those three numbers could differ 
drastically. The first example is the language B*aC*, B,C C A. The next proposition roughly estimates 
jUq(A*) for ^-trivial monoids using their structure. 

In the last section we consider a variety of languages Y such that the corresponding pseudovariety 
of monoids consists of all finite members of a locally finite variety of monoids V. Then the free monoid 
F\{A) in V over a finite set A is the smallest one recognizing all languages in y{A). We embed the 
free monoid in the variety of monoids corresponding to the class BPoli ('^) over A into the product of |A| 
copies of the Schiitzenberger product of Fy{A)OFy{A) which leads to a rough estimate for the cardinality 
of this free monoid. 

2 Recognizing by Automata 

Let A be a finite alphabet and let L C A* be a regular language. The following construction of the minimal 
complete DFA is due to Brzozowski. We put: D = {u^^L | m G A* } - the set of all left derivatives of L 
(here m^^L = { v G A* | mv G L}). One assigns to L its "canonical" minimal automaton = {D,A,-,L,F) 
using left derivatives; namely: 

• D is the (finite) set of states, 

• a G A acts on u^^L by {u^^L) ■ a = a^^ (u^^L), 

• Lis the initial state and 2 G D is a final state (i.e., element of F ) if and only if 1 G 2. 

Proposition 1. Let K and L be languages over a finite alphabet A whose minimal complete DFA have 
k resp. I states and let a G A. Then the minimal complete DFA for the language KaL has at most k2^ 
states. 

Proof. Notice that an arbitrary left derivative of KaL is of the form 

{u^^K)aLUu^^LU • • • U u^^^L, m a non-negative integer, ui,. . . ,Um G A* . 

We have k possible values for u^^K and u^^LU ■ ■ - U u^^L, m < i, has at most 2^ values. The statement 
foUows. □ 

The example in the next proposition is a slight modification of the construction in Theorem 2.1 in 
||6]| . It was suggested to the authors by J. Brzozowski. It shows that the bound from Proposition 1 is tight. 

Proposition 2. For arbitrary natural numbers k,£ >2 there exist languages K resp. L whose minimal 
complete DFA have k resp. i states such that each complete DFA recognizing the language KaL has at 
least k2^ states. 

Proof. Let A = {a,b,c} and let iz/ = {{po,... ,pk-\},A, ■,po, {pk-i}) where 

Pi ■ a = pq, Pi ■ b = pji where /' = / + 1 (mod k), prc = Pi for all / = 0, ... ,^ — 1 . 
Note that accepts the language 

K= L(^) = {uv I u G (A*a)*, v G {^,c}*, and \v\h = k-l (mod A:)} . 
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Similarly, we define ^ = { {^o, • ■ • ,qi-i}A, -,^0, {qe-i}} where 

qj-a = qf where / = j + l (mod £), qj-b = qj, • c = for all j = 0,...,£—l. 
Clearly, for L = L(^), we have 

Ln{a,by = {ue{a,by \ \u\a^i-l (modi)} . 

Both automata ^ and ^ are minimal. 
We define, for all u G {a,b}*, the set 

S{u) = { / G {0, . . . , £ — 1 } I M = vaw such that v £ K and i= \w\a (mod £) } 

and the numbers 

T{u) = the greatest m such that b"" is a suffix of m 

and 

t{u) e {0,...,A:- 1}, t{u) = T{u) (mod A;). 

Let M,v G {a,b}* be such that S{u) 5(v). Let s G 5(m) \5(v) (the case s G 5(v) \5(m) can be treated 
similarly). Then ua^~^~^ G KaL but va^~^~'' ^ X'aL. Then in each complete DFA recognizing KaL with 
the initial state ro we have that 

ro-w/ro-v. 

Now let u,v G {a,b}* be such that ^(m) = 5(v) and t = t{u) > t{v). Then, for w = cb'^^^^'a^, we 
have uw G ^TaL and vw ^TaL. Again, in each complete DFA recognizing KaL with the initial state ro 
we have that 

ro-u^ro-v. 

For an arbitrary subset S = {si,. .. ,Sm} of {0,. .. ,i — I}, where si > ■ ■ ■ > Sm, and f G {0, 1 } there 
exists a word 

u = b'^-^a'^-'^b'^-^a'^-'^b''-^ . . . b'"'^ a'"'^^ b' 

such that S{u) = S and t{u) = t. 

Therefore each complete DFA recognizing KaL has at least k2^ states. □ 

3 Recognizing by Monoids 

Let K and L be languages over a finite alphabet A and let M and be their syntactic monoids. In this 
section we will compare 

• the size of the Schiitzenberger product MON of monoids M and A^, 

• the cardinality of the image of A* in the homomorphism from A* into MON recognizing the 
language KaL, 

• the size of the syntactic monoid of the language KaL. 
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Let M and be finite monoids. Their Schiitzenberger product MON is tlie set of all 2 x2 matrices P 
where P2,i = 0, A,i € M, P22 S A'^ and Pi 2 (^M xN equipped with the multipUcation 

{PQ)i,i = (^2)2,2=^2,222,2 and 

{PQ)U2 = {{Puix,y) I (x,);) € 61,2} U{ (2,^22,2) | ^ ^1,2} • 

It is well known that this operation is associative. This product was introduced by Schiitzenberger and 
by Straubing for an arbitrary finite family of monoids. Basic results are also due to Reutenauer and Pin 
- see S Theorems 1.4 and 1.5 in Chapter 5. Clearly, if \M\ = m and |A^| = n, then |MOA^| = mnl"'". 

Recall that the syntactic congruence of the language /? C A* is a relation ~s on A* defined by: 

u^rv if and only if {M p,q ^A* ) { puq ^ R <^ pvq G /? ) . 

The syntactic monoid of R is the quotient monoid A*/~s. It is the smallest monoid recognizing the 
language R. 

Let A be a finite alphabet and let <p : A* — ;> M, i//^ : A* — ;> A/^ be homomorphisms. Let S CM, T CN 
and let K = q)^^{S), L = y^^{T), i.e. the language K is recognized by M using (p and S, and similarly 
for the language L. One can take the mappings cp and y surjective. 

For a € A, we define a mapping Ha '■ A* — )• MON by 

{Ha{u))ui = (p{u), {lXa{u))2.2 = and 

{l-iaiu))i,2 = { 1 " = u'au" , u ,u" G A* } . 

It is easy to see that it is a homomorphism and that the language KaL is recognized by MON using Ha 
and { P e M OAT I Pi ,2 n 5 X r 7^ } . 

Of course, the language KaL is also recognized by /^^(A*) which can be much smaller than the 
whole MON. Moreover the syntactic monoid of the language KaL is a homomorphic image of the 
monoid jU^(A*). Its size can be much smaller than the cardinality of the monoid jUa(A*). 

First we present, for arbitrary m and n, an example where the mapping jXa is onto. Thus the bound 

mn2"^" for /Xo(A*) is sharp. 

Proposition 3. For arbitrary m and n, there exist languages K and L with syntactic monoids M and N 
and homomorphisms (p : u^ u^^k, ^ '■ u^ u G A*, such that the mapping jXa '■ A* — ^ MON is 
surjective. 

Proof. Let A = {a,b,c}, let m and n be natural numbers and let 

K = {u ^ A* \\u\b =0 (mod m) } and L = {m € A* I = (modn)}. 

The syntactic monoids of K and L are the additive groups Z„, and Z„ and the syntactic homomorphisms 
are given by 

(p{u) = [\u\b\,„ G Z,„ and i/a(m) = [\u\,]n G Z„ . 

Let it G {0, ... ,m- 1},^ G {0,... 1} and O C {0, . . . ,m - 1} x {0, . . . ,« - 1} be arbitrary. We will 
find M G A* such that 

\u\b = k (mod m), \u\c = £ (mod n), and 
{(/7,^) G{0,...,m-l}x{0,...,?i-l} I 
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W\h = P (mod m), \u"\c = q (mod «), u = u'au" , u,u" G A* } = O. 
= {(0,70,1 ),..., (0,7o,po),(l, 71, i),...,(l,7i,pi)> 
(m- l,7„,_i,i),...,(m- l,7„,_i,p,„_,)}, 
n-\> 70,1 > • • • > 7o,po > 0, 1 > 7,„_i,i > • • • > 7m-i,p„-i > 0. 
uq = c"^j° 'ac^° '^j°-a . . . ac^°"o-i^jo-poacj°-"o , 

where Ui = 1 if = 0, for / = 0, . . . ,m — 1. Finally, putting 

u = c^uobu[b . . . Um-\b ■ M , 
we see that this word has all desired properties. □ 

We used GAP to calculate the sizes of syntactic monoids from the last proof for m G {2,3,4} and 
n = 2. The numbers were 61, 379 and 2041. They are of the form mn{2"™ — 1) + 1. This led us to the 
following two results. 

Proposition 4. Let K and L be languages over a finite alphabet A with syntactic monoids M and N, let 
\M\ = m, \N\ = n and let a Then the size of the syntactic monoid of KaL is at most mn{2'"" — 1) + 1. 

Proof (i) Suppose first that both M and A/^ are groups. Let m G A* be such that (/ia(M))i,2 =M xN. Then 
also, for each p,q G A*, it is the case that {iia{puq))i2 = M xN. Therefore, each pair (m, v) G A* x A* 
with (/ia(M))i,2 = (Ma (^)) 1,2 = M X A^^ is in the syntactic congruence of the language KaL. 

(ii) Suppose that the monoid M is not a group (the case N not being a group could be treated in a 
similar way). Let s £ M be without an inverse element. Then there is no f G M with st = I. Indeed, 
such t would imply that u i-^ us, u £ M, is one-to-one and due to the finiteness of M we have that 
{us\u£M} = M. Thus there would be m G M such that us=\ and t = us-t = u-st = u-n contradiction. 

Let u G A* and let {s, 1) G {pia{u))\;i. Thus there exist u' ,u" G A* such that u = u'au" and (p{u') = 
s, ^{u") = 1. Consequently {iXa{u))\j = (p{u) = s(p{a)(p{u") / 1. There are mnl'""^^ matrices inMOA'^ 
not having the element {s, 1) in the set at position (1,2), and (m— 1)^2™"^' matrices in MOA/^ having the 
element {s,l) in the set at position (1,2) and not having 1 at position (1,1). 

Consequently, the size of the syntactic monoid of KaL is less or equal the cardinality of iUa(A*) which 
is at most mnl'""^^ + (m — l)?i2'""^^ The gap between mnl'"" and the last number is at least the needed 
value mn — 1. □ 

Next we show that the estimate from Proposition |4] is exact. 

Proposition 5. For arbitrary m and n, there exist languages K and L with syntactic monoids M and N, 
\M\ = m, \N\ = n, such that the size of the syntactic monoid of KaL is exactly m?i(2'"" — 1) + 1. 
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Proof. We again consider the languages K and L from the proof of Proposition |3] 

(i) Let M, V G A*, {[k]^, [£]n) G (jU«("))i,2 \ {l^a{v))i.2, k£{0,...,m-l},i£{0,...,n- 1}. 
Let 

p = ^ = c"-^ 

Then puq € ^aL and pvq i^aL. 

(ii) LetM,v gA*, M«) ^ (Ata("))i,2 = (Ai«(v))i,2, ke{0,...,m - 1}, 

£ G {0, 1}. Let (/ia(M))ij / (/ia(v))ij. (The case {}Xa{u))2,2 / (Ma(v))2,2 could be treated analo- 
gously). 

Let p = b'"^^, let a be a natural number such that j8 = am — \b'"^'^u\h > 0, and let q = c'^^%^a. 
Then puq G KaL and /jv^ i^aL. □ 

The following example shows that the cardinalities of MON, the cardinality of /^^(A*) and the size 
of the syntactic monoid can be three quite different numbers. 

Example 6. Let a G A, let B,C ^ A and consider the language B*aC*. Syntactic monoids of both B* and 
C* are isomorphic to the two element monoid 2 = {0, 1} having a neutral element 1 and a zero element 
0. Moreover, for a G A, <p(a) = 1 if and only if a G B, and iff (a) = 1 if and only if a G C. Finally 
S = T = {\}. 

Clearly, the cardinality of 202 is 2 • 2 • 2^'^ = 64. 

Let A = {a,b,c,d}, B = {a,b} and C = {a,c}. One can calculate that |/ia(A*)| = 22. Finally, it is 
well known and easy to see that the syntactic monoid of B*aC* is isomorphic to the 8-element monoid 
of Boolean uppertriangular matrices of order 2. 

We will try to estimate the number I using the structures of monoids M and A^. The first little 

step concerns very special monoids and certain chains of their elements. 

Green's relations are a basic tool in semigroup theory: define on an arbitrary monoid O the qua- 
siorders <^, <^ and < ^ as follows: 

P ^ iff = qr for some r, p <^ qiff p = sq for some s , 

and p < jf qi& p = sqr for some r, s . 

A monoid O is ^ -trivial p < / q </ p implies that p = q. For each u G A*, we define c(m) (the 
content of u) as the set of all letters of u. 

Proposition 7. Let M and N be finite ^ -trivial monoids having cardinalities m and n. Let the number of 
elements in a longest strict <ag-chain in M is p and the number of elements in a longest strict <y-chain 
in N is A. Let a G A, (p : A* — ?> M, : A* ^ N be homomorphisms. Then the number of elements of each 
set of {Ha{u))\2-, M G A*, is less or equal to p — \ (which is < m -\- n = \}. In particular, 

I /-*M ^ .fmn\ fmn\ ( mn \. 

l^«(^)l^'""Vj + (lj+-+(p + A-lj)- 

Proof. Let u = uoauiau2 ■ ■ -au^ where a c(mo),c(mi), . . . ,c{uk). Then 

(Ma(«))i,2 = { {q){uQ),Yiuiau2 ■ ..auk)), {q){uoaui),Yiu2au3 . ..aut)),. . ■ 



and the statement follows. 



. ,{(f){uQaui . . .auk-\),^f{uu))} 



□ 
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The following example shows that the bound for |(/ia(M))i,2| from Proposition |7] is sharp. 

Example 8. For B C A we write B = {u £ A* \ c{u) = B} . Notice first that, for B ^ a, the syntactic 
monoid of B is isomorphic to the monoid (2^, U) with a zero adjoined. The syntactic homomorphism 
<p maps M G B* onto c{u) and (p{u) = 0, otherwise, and we have S = {B}. All the relations < j 

coincide with the reverse inclusion Similarly for C ^ A. 

Consider first the language BaC for A = {a,Zj,c}, B = {a,^}, C = {a,c}. Then p = A = 4 and 
p + A - 1 = 7 and 

{Ha{aababacacaa))i^2 = { (0,0), ({a},0), (B,0), (B,C), (0,C), (0, {a}), (0,0) } . 

We can modify this example for arbitrary p, A > 4 as follows: 

B = {a,bu...,bp-3}, C = {a,ci,...,c;i„3}, A = BUC. 

Then 

{lJ.aiaabiab2a . . .abp_^abiaciac^_^a . . .ac2ac\aa)\ 2 = 
= {(0,O),(M,O),({a,Z7i},O),({a,Z7i,ft2},O),...,(B,O),(B,C),(O,C),... 
. . . , (0, {a,ci ,C2}), (0, {a,ci })(0, W), (0,0) } . 

4 Level of Varieties 

Let Y he. n variety of languages. A well known fact is that the pseudovariety of monoids corresponding 
to the class BPol(V) = U/t>o BPol/:(V) is generated by all Schiitzenberger products 0(Mo, . . . ,M„) where 
Mo, . . . ,M„ are syntactic monoids of languages from Y - see ([3], Theorems 5. 1.4. and 5. 1.5.). Of course 
being interested in BPol<:(V), one takes n = k. 

Here we are looking for a single finite monoid recognizing all languages in ^(A), A fixed. We can 
succeed under certain circumstances as follows. Let V be a locally finite variety of monoids, i.e. the 
finitely generated monoids in V are finite. Let ~ be the corresponding fully invariant congruence on 
X*, X = {xi ,X2, ■■■}, i.e. the set of all identities which hold in V. Notice that X* is the free monoid 
in V over the set X. The finite members of V form a (the so-called equational) pseudovariety of finite 
monoids. We denote the corresponding variety of languages by Y , i.e. L € Y{A) if and only if the 
syntactic monoid of L is a member of V. Then the free monoid in V over the set A is the smallest monoid 
recognizing all languages from ^{A). Thus we consider somehow the descriptional complexity for the 
whole varieties of languages. 

One of the main results of was an effective description of the fully invariant congruence for 
the variety BPoljt(V). Here we treat only the case of ^ = 1. 

Result 9 ((in, Theorem 1)). For m, v G A*, we have 

u if and only ifu ~ v and for each a € A, 

( / I II \ \ I II I II ^ ( / I II \ I I II I II ^ A* ^ 

^) \ u = u au ,u,u eA ) = |(v~,v ^) \ v = v av ,v,v €A j. 
Proposition 10. Let A = {ai , . . . ,arf} C X and let 



: A* ^ (A7~ O A7~) X • • • X (A7~ O A7~) {d times ) 
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be given by 

Then (A*) is isomorphic to A*/~i, i.e. to the free monoid in BPoli (V) over the alphabet A. 

In particular, if the cardinality of A* is n, then the size of A* is bounded by the number nl!^^ . 

Proof. The first part follows immediately from Result |9l To get the estimate, realize that all the diagonal 
entries in the matrices \ia^ (m) , . . . , /Iq^ (m), for a given m G A*, are the same. □ 

Let us consider the simplest non-trivial example. It shows, among others, that the estimate from the 
last proposition can be far from being optimal. 

Example 11. Let V = SL - the class of all semilattices. Then m ~ v if and only if c(u) = c(v). The free 
semilattice (in the signature of monoids) over a set A C X is isomorphic to M = {2^,U). In particular, 
this variety is locally finite. For the corresponding variety of languages Y and a finite alphabet A, the set 
y (A) consists of unions of B's, B C A. 

Let A = {a,b}. We are going to improve the bound 4 • 2^'^" from the last proposition. Clearly, the 
cardinality of MOM is 2^". We will calculate the image of Ha first. 

We write also, for ai, . . . ,ayt € A, h(ai . . .a^) = a\ and t{a\ . . .a^^) = aj^. Let u = u^auia. . . auj^ where 
uq,.. . ,Uk G {b}*. The characteristic sequence char(M) of u is 

( (c(Mo),c(Mia . . .auk)), {c{uoaui),c{u2a . . .aut)),. ■ ■ , {c{uoa . . .auk-\)-,c{uk)) ) 

with removed repetitions. We get (/ia(M))i,2 when considering it as a set. Note that i = 

{lJLa{u))22 = for each u G A*. We divide the elements of A* into several classes: 

(i) For M = 1 we have c(m) = 0, char(M) = 1 (the sequence of length 0). 

(ii) For u = b'^ , k > I, we have c{u) = {b}, char(M) = 1. 

(iii) For u = a^,k>\, we have c(m) = {a} and char(a) = ((0,0)), char(a^) = ((0, {a}), ({a},0)), 
and char(a*) = ((0,{a}),({a},{a}),({a},0)) if /:>3. 

All remaining words have c{u) = A. 

(iv) If \u\a = 1, then char(M) is one of the following sequences ((0, {b})), (({Zj},0)), {{{b}^{b})). 
All remaining words have \u\a > 2. 

(v) If h{u) = a, t{u) = b, ba not being a subword of u, i.e. u = a^b\ k > 2, £ > \. Then either 
char(M) = ( (0,A), {{a},{b}) ) for it = 2 or char(M) = ( (0,A), {{a},A),{{a}, {b}) ) for A: > 3. 

(vi) The case u = b^a^, k >2, i > I is left-right dual to (v). 

(vii) If h(M) = a, t{u) = b, ba being a subword of u, then char(M) is a subsequence of 

((0,A),({a},A),(A,A),(A,W)) 

containing the first and the last item. The following words witness that all 4 possibilities can happen: 
abab, ababab, aabab, aababab. 

(viii) The case left-right dual to (vii). 

(ix) If h{u) = t{u) = a, then char(M) is a subsequence of 

((0,A),({a},A),(A,A),(A,M),(A,0)) 

containing the first and the last item. The following words witness that all 8 possibilities can happen: 
aba , aaba , abaa , ababa , aabaa , aababa , ababaa , aababaa . 
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(x)If h(M) =t(M) =Z7,thenchar(M) = {{{b},A),{A,{b})) or char(M) = {{{b},A),{A,A),{A,{b})). 
Appropriate words are baab and baaab. 

Altogether we have 30 elements in }JLa{A*). In fact our consideration until now could be presented in 
Section 3. Returning to the free monoid in the variety corresponding to the class BPoli(SL) over A, we 
can state at present only that it has at most 30 • 30 elements. When considering the mapping , not all 
possible 900 combinations can happen and we can further decrease the estimate for |(^ (A*) |. Using more 
advanced techniques we can even get 100 as an upper bound. 

Acknowledgement. The authors would hke to express their gratitude to Janusz Brzozowski who sug- 
gested them to use the construction from 161 in the proof of Proposition 2. 
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